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Abstract 

In this paper we study simulation based optimization algorithms 
for solving discrete time optimal stopping problems. This type of al- 
gorithms became popular among practioneers working in the area of 
quantitative finance. Using large deviation theory for the increments 
of empirical processes, we derive optimal convergence rates and show 
that they can not be improved in general. The rates derived provide 
a guide to the choice of the number of simulated paths needed in op- 
timization step, which is crucial for the good performance of any sim- 
ulation based optimization algorithm. Finally, we present a numerical 
example of solving optimal stopping problem arising in option pricing 
that illustrates our theoretical findings. 

Keywords: optimal stopping, simulation based algorithms, entropy with 
bracketing, increments of empirical processes 

1 Introduction 

The theory of optimal stopping is concerned with the problem of choosing 
a time to take a particular action, in order to maximise an expected reward 
or minimise an expected cost. Optimal stopping problems can be found 
in many areas of statistics, economics, and mathematical finance. They 
can often be written in the form of a Bellman equation, and are therefore 
often solved using dynamic programming. Results on optimal stopping were 
first developed in the discrete case. The formulation of optimal stopping 
problems for discrete stochastic processes was in sequential analysis, an area 
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of mathematical statistics where the number of observations is not fixed in 
advance but is a random nu mber determined by the behavior of the data 
being observed. Snell ( 19521 ) was the first person to come up with results 
on optimal stoppin g theory for stochast i c pro cesses in discrete time. We 
refer to the book of Peskir and Shiryaev ( 20061 ) for a comprehensive review 
on different aspects of optimal stopping problems. 

A huge impetus to the development of optimal stopping theory was pro- 
vided by option pricing theory, developed in the late 1960s and the 1970s. 
According to the modern financial theory, pricing an American option in a 
complete market is equivalent to solving an optimal stopping problem (with 
a corresponding generalization in incomplete markets), the optimal stopping 
time being the rational time for the option to be exercised. Due to the enor- 
mous importance of the early exercise feature in finance, this line of research 
has been intensively pursued in recent times. Solving the optimal stopping 
problem and hence pricing an American option is straightforward in low 
dimensions. However, many problems arising in practice have high dimen- 
sions, and these applications have motivated the development of Monte Carlo 
methods for pricing American option. Solving a high-dimensional optimal 
stopping problems or pricing American style derivatives with Monte Carlo 
is a challenging task because the determination of the optimal value func- 
tion requires a backwards dynamic programming algorithm that appears to 
be incompatible with the forward nature of Monte Carlo simulation. Much 
research was focused on the development of fast methods to compute ap- 
proximati qns to the optimal value function . Notable examples include mesh 
m ethod o f Broadie and Glasserman (1997), t he reg r ession-based approaches 
of Carriers dl996h . Longstaff and Schwartz ( 200ll ). Tsitsiklis and Van Roy 
(1999) and lEglofil (|2005l ). All these methods aim at approximating the 
so called continuation values that can be used later to construct subop- 
timal strategies and to produce lower bounds for the optimal value function. 
The convergence an alysis for thi s type of methods was performed in sev- 
eral papers includi ng Eglo ff (2005), Egloff. Kohler and Todorovic ( 20071 ) and 
Belomestnvl feOQfll ). An alternative to trying to approximate the continua- 



tion values is to find the best value function within a class of stopping rules. 
This reduces the optimal stopping problem to a much more tractable finite 
dimensional optimization problem. Such optimization problems appear nat- 
urally if one considers finite dimensional or parametric approximations for 
the corresponding stopping regions. The latter type of algorithm s became 



articularly popular among practioneers (see e.g. Andersen ( 2000l ) or Garcia 



iool ). However, the practical success of simulation-based optimization al- 
gorithms has not been yet fully explained by existing theory, and our analysis 
here represents a further step toward an improved understanding. The main 
goal of this work is to provide rigorous convergence analysis of simulation 
based optimization algorithms for discrete time optimal stopping problems. 
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Let us start with a general stochastic programming problem 



(1.1) 



h* := minE P [/t(6> 6)1, 



where is a subset of W 71 , £ is a M. d valued random variable on the probabil- 
ity space (n, F, P) and h : l m xl d — ► R. Draw an i.i.d. sample 6 (1) , • • • , £ {M) 
from the distribution of £ and define 
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It is well known (see e.g. Shapiro ( 19931 )) that under very mild conditions it 

holds h M ~h* = P (M- 1 / 2 ).In their pioneering work|Shapiro and Homem-de-Mellol 
( 200d ) (see also Klevwegt. Shapiro and Homem-de-Mellol (|2001 )) showed that 
in the case of discrete random variable £, the convergence of hjy to h* can be 
much faster than M _1//2 , making Monte Carlo method particularly efficient 
in this situation. Turn now to the discrete time optimal stopping problem: 



(1.2) 



V = sup E[Z T 

1<t<K 



where r is a stopping time taking values in the set {1, . . . , K} and (Zfc)fc>o 
is a Markov chain. Since the random variable r takes only discrete values, 
one can ask whether the simulation based methods in the case of discrete 
time optimal stopping problem (\1.2\) can be as efficient as in the case of 
(|l.ip with discrete r.v. £. In this work we give an affirmative answer to this 
question by deriving the optimal rates of convergence for the corresponding 
Monte Carlo estimate of V based on M paths and showing that these rates 
are usually faster than M -1 / 2 . 



2 Main setup 



Let us consider a Markov chain X = (X^^q defined on a filtered probability 
space (f2, T , (Tk)k>0i Px) an d taking values in a measurable space (E,B), 
where for simplicity we assume that E = R d for some d > 1 and B = B(WL ) 
is the Borel <7-algebra on R d . It is assumed that the chain X starts at x 
under P x for some x £ E . We also assume that the mapping x i— > P X (A) is 
measurable for each A £ T . Fix some natural number K > 0. Given a set 
of measurable functions Gk k = 1, . . . , K, satisfying 



E, 



sup \G k (X k ) 

Kk<K 



< oo 



for all x E E , we consider the optimal stopping problems 
(2.3) V k *(x):= sup E kiX [G T (X T )}, k = l,...,K, 

k<r<K 
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where for any x G E the expectation in (|2.3I) is taken w.r.t. the measure 
Pk,x such that X k = x under P^ ^ an d the supremum is taken over all 
stopping times r with respect to (J r n )n>0- Introduce the stopping region 
S* = «S* x ... x S* K with S* K = E and 

SI := {x G E : F fc *(x) = 

= {xeE:E[V k * +1 (X k+1 )\F k ] <G k (x)}, k = 1, . . . , K - 1. 

Introduce also the first entry times r£ into S* by setting 

r fc * := T k (S*) := min{k < I < K : X t £ S t }. 

It is well known that the value functions V k {x) satisfy the so called Wald- 
Bellman equations 

V k *(x) = max{G k {x),V n , x [V k * +1 (X k+1 )}}, k = 1, . . . , K - 1, x£E 

with Vx(x) = Gk{x) by definition. Moreover, the stopping times T k are 
optimal in (|2.3[) . i.e. 



k = l,...,K. 



Let (^ m ^)fc=o,...,ii") m = 1, . . . , M be M independent processes with the 
same distribution as X all starting from the point x G E. We can think 
of [X k , . . . , ), fc = 0, . . . , K, as a new process defined on the product 
probability space equipped with the product measure P® . Let 25 be a 
collection of sets from the product er-algebra 

£ K := £ (8) . . . <g> 5 



A" 



that contains all sets S E 0^ of the form «S = Si x . . . x Sk—i X ^ with 
5fc G i3, = 1, — 1. Here we take into account the fact that the 

stopping set Sk must coincide with E. Let 6 be a subset of OS. Define 



f 1 M 

S M := argmax I — ^ G n(s) (x 



M 

(m) 

n(S), 



The stopping rule 

tm ■= ti(S m ) = min{l < k < K : X k E 5 M ,fc}- 

is generally suboptimal and therefore the corresponding Monte Carlo esti- 
mate 

1 N (~ \ 

(2.4) Vm, K -.= w Z^{*%) 
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with 



» ._ 



min{l <k<K: £ S M ,k}, n = 1, . . . , N 



' M 



based on a new, independent of {X^\ . . . , X^ M )) set of trajectories 



(n) 



,XP), n=l,...,N, 



fulfills 



(2.5) V M := E x \v m ,n\X {1 \. • • ,X< M )] < sup E x [G Tl{s) (X Tl(s) )} 



see 



If the set 6 is rich enough, then 

sup E x [G Tl(S ) { X n(S))] =■ E a 
see 



G n(S) ( X n(S)) ~ E x [G T1 ( S *) { X n(S*))] 



and Vm,n can serve as a good approximation for for large enough M and 
N. In the next section we are going to study the question: how fast does Vm 
converge to V* = as M — > oo ? We will show that the corresponding 
rates of convergence are always faster than usual rates M -1 / 2 . This fact has 
a practical implication since it indicates that M, the number of simulated 
paths used in the optimization step, can be taken much smaller than N, the 
number of paths used to compute the final estimate Vm n- 



3 Main results 

Definition Let 5 > be a given number and dx(-, •) be a pseudedistance 
between two elements of 03 defined as 

K 

(3.6) dx{G\ x ... x Gk, G[x ... x G' K ) = £ P x (X(t k ) € G k AG' k ), 

k=l 

where {G k } and {G' k } are subsets of E. Define N(S, &,dx) be the smallest 
value n for which there exist pairs of sets 

{Gf A x ... x Gf >K ,Gf tl x ... x Gf tK ), j = l,...,n, 

such that d x {Gf A x ... x G^, Gj^ x ... x G u jK ) < 5 for all j = 1, . . . , n, 
and for any G 6 6 there exists j(G) £ {1, . . . , n} for which 

G j(G),k ^G k C G^ G) k , k = 1, . . . , K. 

Then the value H(S,&,d) := log[N(5,&,dx)] is called the <5-entropy with 
bracketing of & for the pseudedistance dx- 



5 



Assumption We assume that the family of stopping regions © is such 
that 

(3.7) H{5,e,d x ) < A5- p 

for some constant A > 0, any < 5 < 1 and some p > 0. 

K 



Example Let 6 = 6 7 , where 6 7 is a class of subsets of M. x...xR 
with boundaries of Holder smoothness 7 > defined as follows. For given 
7 > and d > 2 consider the functions b(x\, . . . , Xd-i), b : R d_1 — ► R having 
continuous partial derivatives of order I, where I is the maximal integer that 
is strictly less than 7. For such functions b, we denote the Taylor polynomial 
of order I at a point x E by TTb,x- For a given H > 0, let £(7, -ff) be 

the class of functions 6 such that 



where ||y|| stands for the Euclidean norm of y G M d 1 . Any function b from 
£(7, H) determines a set 

5 6 := {( Xl ,...,x d ) £R d : < x d < b(x u . . . , x d _i)}. 

Define the class 

(3.8) S 7 := {S bl x . . . x S , bK _ 1 x£:ti,.., G £(7, #)}. 

It can be shown (see Dudley . 19991 . Section 8.2) that the class 6 7 fulfills 

H(8,& 1: d x ) < A5- { - K - l){d - 1]h 

for some A > and all 5 > small enough. Now we are in the position to 
formulate the main result of our study. 



Theorem 3.1. Let & be a subset o/5S such that assumption (13. 7|) is fulfilled 
with some < p < 1 and 

(3.9) E, [G n(s .) (X^)] -V< M-W+'l 

u>ii/i V" := sup 5g gEa; [G Tl (5) (X T1 (£))] and some constant D > 0. Assume 
that all functions are uniformly bounded and the inequalities 

(3.10) P s (|G fe pr fc ) - E[y fc Vi(X fc +i)|^fe]| < 6) < A , k 5 a , 5 < 6 

hold for some a > 0, Aq^ > 0, k = 1, . . . , K — 1, and 5o > 0. Then for any 
U >U and M > M 

(3.11) P® M (V* - V M > (U/M) < Cexp(-Vu/B). 
with some constants Uq > 0, Mq > 0, B > and C > 0. 
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Remark 3.2. Without condition (|3.9p the inequality (13.111) continues to hold 
with V* replaced by V, the best approximation of V* within the class of 
stopping regions 6. 

Remark 3.3. The requirement that functions are uniformly bounded can 
be replaced by the existence of all moments of Gk(Xk), k = 1,... ,K — 1, 
under P . In this case on can reformulate Theorem 16.11 using generalized en- 
tropy with bracketing inst ead of usual entropy with bracketing (see Chapter 
5.4 in lVan de Geen tood )). 



The above convergence rates can not be in general improved as shown 
in the next theorem. 

Proposition 3.4. Consider the problem (I2.3P with k = 1 and two possible 
stopping dates, i.e. r G {1,2}. Fix a pair of non- zero functions Gi,G 2 such 
that G 2 : M d -> {0, 1} and < G^x) < 1 on [0, l] d . Fix some 7 > and 
a > and Zei "Pa^ &e a class of pricing measures such that the condition 
(|3.10p is fulfilled and for any P E "Pa,7 corresponding stopping set Sp is 
m S 7 . T/ien i/iere ezisi a subset V ofV a ^ and a constant B > swc/i that 
for any M > 1, any stopping time tm £ {1,2} measurable w.r.t. J^® M 

sup \ sup Ep [G T (X T )\ - E p «m [E P G TM (X Tm )\ I > BM" TOfe^ . 
PeP [re{i,2} J 

Discussion It follows from Theorem 13.11 that 

V* - V M = P (^M~ 2 +^+»A = o P (M" 1/2 ) 
as long as a > 0. Using the decomposition 

V* — Vm,n = V* — Vm + Vm — Vm,n 
and the fact that Vm — Vm,n = Op(l/yN) for any M > 0, we conclude that 



V* - V M n = P [M 2+-d+p) + AT-2 . 



Hence, given N, a reasonable choice of M, the number of Monte Carlo paths 

2+q(l + p) 

used in the optimization step, can be defined as M x N 2 ( 1 + a ) . In the case 
when there exists a parametric family of stopping regions satisfying (|3.9[) 
(see Section H] for some examples) , one gets 

(3.12) MxJV 5 ^) 

since any parametric family of stopping regions with finite dimensional pa- 
rameter set fulfills (|3.7p for arbitrary small p > 0. Let us also make a few 
remarks on the condition (|3.1Up and the parameter a. If each function 
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Gk(x) — Efc iX [V fc * +1 (Xfc+i)], k = 1, ... ,K — 1, has a non-vanishing Jacobian 
in the vicinity of the stopping boundary dS^ and X^ has continuous dis- 
tribution, then (|3.10p is fulfilled with a = 1. In fact, it is not difficult to 
construct examples showing that the parameter a can take any value from 
If a = 1 (the most common case) (I3.12p simplifies to M x iV 3 / 4 , the 
choice supported by our numerical example. 

Finally, we would like to mention an interesting methodological connec- 
tion between our analysis and the analysis of sta tistical di scrimination prob- 

lem pe rformed in lMammen and Tsvbakovl (|1999l ) (see also lDevrove. Gvorfi and Lugosi 

In particular, we need similar results form the theory of empirical 
processes and the condition (|3.10|) formally resembles the so called "margin" 
condition often encountered in the literature on discrimination analysis. 



4 Applications 



In this section we illustrate our theoretical results by some financial applica- 
tions. Namely, we consider the problem of pricing Bermudan options. The 
pricing of American-style options is one of the most challenging problems in 
computational finance, particularly when more than one factor affects the 
option values. Simulation based methods have become increasingly attrac- 
tive compared to other numerical methods as the dimension of the problem 
increases. The reason for this is that the convergence rates of simulation 
based methods are generally independent of the number of state variables. 
In the context of our p aper we consider t he so called parametric approx- 
imation algorithms (see Glasserman . 20031 . Section 8.2). In essence, these 
algorithms represent the optimal stopping sets S£ by a finite numbers of 
parameters and then find the Bermudan option price by maximizing, over 
the parameter space, a Monte Carlo approximation of the corresponding 
value function. The important question here is wether on can parametrize 
the optimal stopping region S* by a finite dimensional set of parameters, 
i.e. S* = S(8), £ 0, where is a compact finite di mensiona l set. It turns 
out that that this is possible in many situations (see Garcia ( 200ll )). The 
assumption (13.70 and (|3.9p are then automatically fulfilled with arbitrary 
small p > 0. 



4.1 Numerical example: Bermudan max call 



This is a benchmar k example studied in lBroadie and Glassermanl (119971 ) and 
Glasserman! (|2003l ) among others. Specifically, the model with d identically 
distributed assets is considered, where each underlying has dividend yield 5. 
The risk-neutral dynamic of the asset X(t) = (X 1 (t), . . . , X d (i)) is given by 



dX l (t) 



(r-5)dt + adW\t), X l (0) = x , l = l,...,d, 



S 



where W (i), I = l,...,d, are independent one-dimensional Brownian mo- 
tions and xq, r, 5, a are constants. At any time t € {ii, fx} the holder of 
the option may exercise it and receive the payoff 



G k (X k ) := (max^ 1 ,..,!, 



where X k := X(t k ) for A; = 1, . . . , K. We take d = 2, r = 5%, 5 = 10%, 
a = 0.2, ^ = 100 x n = 9 and t fc = fcT/if, fc = 1, . . . , K, with T = 3, K = 9 



as m 



Glassermanl (|2003l . Chapter 8). 
To describe the optimal early exercise region at date t k , k = 1, . . . , K, one 
can divide M. 2 into three di fferent connected sets: one exe rcise region and two 



continuation regions (see iBroadie and Detempld (|19971 ) for more details). 



All these regions can be parameterized by using two functions depending on 
two dimensional parameter 8 k Gl 2 . Making use of t his characteri zation, we 



define a parametric family of stopping regions as in iGarcial (120011 ) via 



S k {9 k ) ■= {(xi,x 2 ) : max(max(xi,x 2 ) - K,0) > B\; \xi - x 2 \ > 9 2 k }, 

where 6 k £ Q, k = 1, . . . , K and is a compact subset of M 2 . Furthermore, 
we simplify the corresponding optimization problem by setting 9\ = . . . = 
6k- This will introduce an additional bias and hence may increase the left 
hand side of (|3.9p (see Remark l3.2p . However, this bias turns out to be rather 
small in practice. In order to implement and analyze the simulation based 
optimization based algorithm in this situation, we perform the following 
steps: 

• Simulate L independent sets of trajectories of the process (X k ) each 
of the size M: 

(Af m \...,X^ m >), m = l,...,M, 
where / = 1, . . . , L. 

• Compute estimates 0$ , . . . , 6^ via 

M 



'M 



I m=l 



• Simulate a new set of trajectories of size ./V independent of (xj!' m ^) : 

(x{ n \...,xP), n = l,...,N. 

• Compute L estimates for the optimal value function V£ as follows 

V M,N := T7 ^2 G r (I, "» ( X^L) J , l = l,...,L, 
iV «=i V t m J 
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with 
.(l,n) . 



t' m ' : = min <l < k < K : X k 



(n) 



1 M 



n 



1,...,N. 



Denote by cjm,n,i the standard deviation computed from the sample 
(G (;,„) , n = 1, . . . , N) and set a M ,N = min ; a M N l . 

Compute 



1 



HM,N,L 



L 



r(l) 



L M,N> 
1=1 



L - 1 



L 

£ 

z=i 



(0 



By the law of large numbers 



(4.13) 
(4.14) 
where 



fJ>M,N,L — > E p ®M [Vm,2v] , 

p 



7V,L 



L — > oo, 
Var p ®M [Vm,at] , L — > oo, 



V M ,N :=^XX<'/> ( X %) 

n=l v M 7 



The difference V — Vm,n with 

^ : = m a xE[G ri(5(e)) (X Tl(5(0)) )] 

can be decomposed into the sum of three terms 

(4.15) (V - E p «m [V M ]) + (E p ®m [V M ] -V m ) + V m - V m ,n- 



The first term in (|4.15p is deterministic and can be approximated by Q\{M) := 
fJ-M*,N*,L* — fJ-M,N*,L* with large enough L*, M* and N*. The variability of 
the second, zero mean, stochastic term can be measured by -^Varp^Af [Vm] 
which in turn can be estimated by Q2(M) := y/i9M,N*,L*, due to (|4.14|) . 
The standard deviation of Vm — Vm,n for any M can be approximated by 
Q 3 (N) = (Tm*,n/VN. In our simulation study we take N* = 1000000, L* = 
500, M* = 10000 and obtain V « Hm*,N*,l* = 7.96 (note that V* = 8.07 



according to iGlassermanl (|2003h ). In the left-hand side of Figure [T] we plot 
both quantities Qi(M) and Q 2 (M) as functions of M. Note that Q 2 (M) 
dominates Q\(M), especially for large M. Hence, by comparing Q 2 (M) with 
Qs(N) and approximately solving the equation Q 2 (M) = Q^(N) in N, one 
can infer on the optimal relation between M and N. In Figure Q] (on the 
right-hand side) the resulting empirical relation is depicted by crosses. Ad- 
ditionally, we plotted two benchmark curves N = M 4 / 3 and N = M 4 - 5 / 3 . 
As one can see the choice M = N 3 ^ is likely to be sufficient in this situa- 
tion since it always leads to the inequality Q\{M) + aQ 2 (M) < (tQ^{N) for 
any a > 1. As a consequence, for M = iV 3 / 4 and any N, V lies with high 
probability in the interval [/j,m,n,l* — crQs(N), fj,M,N,L* + cr Q3(A r )], provided 
that a is large enough. 
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Figure 1: Left: functions Q\{M) and Q2(M); Right: optimal empirical 
relationship between M and N (crosses) together with benchmark curves 
N = M 4 / 3 (dashed line) and N = M 4 - 5 / 3 (dotted line). 

5 Proof of main results 
5.1 Proof of Theorem 13.11 



Define 



M 



A M (S) 



£ {G T1(S) (X%> s) ) - E [G nW (X Tlis) )] } 



m=l 



and A M (S',S) := A M (S') - A M (S) for any S',S £ 6. Since 



M 



m=l 

with probability 1, it holds 



x (m L i < 



i M 



\f -n(S) \^t x {S)) - mL ^n(S M ) y X T1 (S M ) 

m=l m=l 



(m) 



(5.16) A(S M ) < A(S) + 



[A M (S*,S) + A M (S M ,S* 



>M 



with A(5) := E[G T .(X Tf )] - E[G Tl(5) (X n(5) )]. Set e M = M'W+p) then 

2 



A(«S M ) < A(5) + 



sup \A M (S*,S) 

M see:A G (s\s)<EM 



+2 x 



(5*, 5 



x sup 

<SeS: A G (S*,S)>e M 



\A M (S*,S)\ 

a£~ p \s*,s) 
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Define 



m,M-= sup \A M (S*,S)\, 

see-. A G (s*,s)<e M 

I A M (S*, S)\ 



y^2,M ■= SUp 



se&.A G (s*,s)> SM A^ 1 p) (S*,S) 

and set _4o := {Wi,Af < ^ £ m P } f° r ^ > ^o- Note that under assump- 
tion ()3.7p the condition (|6.17)) of Theorem 16.11 is fulfilled with v = 2p due 
to Corollary E3 Hence Theorem O yields P(A ) < Cexp(-C/e M 2p /C 2 ). 
Denote 



Aq(S,S') := |e G ti(<S ) - G , Tl ( <s /) (x^)) j 



1/2 



for any «S,«S' G 53. Since A(5) < DM" 1 /^) anc i e]^ p /VM = Af-V^+p), 
we get on ^4o 

A(5m) < CoM-Vd+P) + 2 x 

v Af 



with Co = D + 2U. Combining Corollary 16.31 with Corollary 16.41 leads to the 
inequality 

A G (S*,S M ) < 2V2A G v- a ^ 1+a ^A a ^ 1+ ^(S M ) 

which holds on the set Ai := {Ax(S* ,Sm) < 8a}, where 5 a and v a are 
defined in Corollary 16.41 Denote 

A 2 := [a(S m ) > C (l + x)M- 1 /( 1+ p)} 

with some x > 0. It then holds on Aq fl -4i n 

A a(l-p)/(2(l+a)) C o ■) 

A(5 M ) < 2 1 MJ W 2 , M 



and therefore 

A(«S M ) < (x/2)-^Af-^ 2 W 2 V 
with 1/ = 2 +a(i+p) • Let us now estimate P(^4i). Using Corollary 16. 4\ we get 

p® m (a x (s\s m ) ><y < 

Pr((^)A(5 M) + ^A- y >^ 

= Pf M (A(5 A/ ) > c a ) 
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with c a = 5q5o2 1 / a \1 — 2(1+0,) J • Furthermore, due to (|5.16|) 

P® M {A(S M ) > c a ) < P® M (DM-W+ti + 2M- 1 / 2 sup \A M (S)\ > c a ) 

V see J 

< P® M (sup \A M (S)\ > c a ^/M/4] 
\see / 

for large enough M. Theorem 16.11 implies 

P® M (sup |A M (5)| > c a VM/4 ) < B ie xp(-MB 2 ) 
Vsee / 

with some constants i?i > and -B2 = -62(0) > 0. Applying Theorem 16. II to 
m an d using the fact that u/2 < 1/(1 + p) for all < p < 1, we finally 
obtain the inequality 



Pf (a(5 m )>(W /2 ) < Cexp(-W/B 3 



Ue 



-Cexp I -— ^f— J +£iexp(-M£ 2 ) 

which holds for all V > Vq and M > Mq with some constant B% depending 
on x. 



5.2 Proof of Proposition HOI 

For simplicity, we give the proof only for the case d = 2 (an extension to 
higher dimensions is straightforward). In the case of two exercise dates 
the corresponding optimal stopping problem is completely specified by the 
distribution of the vector (Xi, G^^)). Because of a digital structure of 
G2 the distribution of (Xi, G^-X^)) would be completely determined if the 
marginal distribution of X\ and the probability P(G2(X2) = 1\X\ = x) are 
defined. Taking into account this, we now construct a family of distributions 
for (Xi, G^P^)) indexed by elements of the set U = {0, l} m . First, the 
marginal distribution of X is supposed to be the same for all ui G f2 and 
posseses a density p(x) satisfying 

< v* < P(x) <p*<oo, x £ [0, l] 2 . 

Let us now construct a family of conditional distributions P^G^X) = 
l\Xi = x), <jj E $7. To this end let cj) be an infinitely many times differentiable 
function on R with the following properties: (ft(z) = for \z\ > 1, <p(z) > 
for all z and sup 2eR [^(z)] < 1. For j = 1, . . . , m put 



4>j(z) := 5m 1 (f) 





2j - 11 




z 




m 
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with some < 6 < 1. For vectors to = (uji, . . . ,cj m ) of elements ujj £ {0, 1} 
and for any z£l define 



b(z, u) 



Put for any and any x £ 



3=1 



C u {x) := V u] {G 2 {X 2 ) = l\X l =x) = 

= G 1 (x)-Am-~ f/a l{0<x 2 <b{x 1 ,Lu)} 
+Am~~ f / a l (6(2:1,0;) < x 2 < SmT^} , 

where A is a positive constant. Due to our assumptions on G\(x), there are 
constants < G_ < G + < 1 such that 

G- < Gi(x) < G+, xe[0,l} 2 . 

Hence, the constant A can be chosen in such a way that C w {x) remains 
positive and strictly less than 1 on [0, l] 2 for any uj 6 f2. The stopping set 

Suj := {x : C u (x) < Gi(x)} = {(xi,x 2 ) : < x 2 < b(xi,u;)} 

belongs to S 7 since b(-,u) £ £(7, L) for 5 small enough. Moreover, for any 
77 > 

P w (|Gi(Xi)-C7 w (Xi)|<r7) = P^(0 < X 2 < 5m^)l(Am-^ a < r?) 



< ip*m- 7 l(im" 7/ " < 7?) < tfp*A" 



and the condition (|3.10p is fulfilled. Let tm be a stopping time w.r.t. J^® M ; 
then the identity (see Lemma [ 



EpJG T ,(X T ,)]-E P JG TM (X rM )] = E p J(G 1 (X 1 )-G 2 (X 2 ))1(t* = 1,t m = 2)] 

+ E P „ [(G 2 (X 2 )-Gi(Xi))1(t* = 2,r M = 1)] 
= E P „ - E(G 2 (X 2 )|^ 1 )|l{r A/ / r*}] 

leads to 

EpJG T ,(X T ,)]-E pg M {E P JG TM (X TM )]} = EpgM E P „ [|A w (Xi)|l{r M / r*}] 

with A w (x) := Gi(x) — C UJ (x). By conditioning on Xi we get 

E p « M E P ^ [lA^^OIl^ / r*}] = Am" 7 /" P(0 < X\ < 5m^) P® M (t m + r*) 

> Am^^m^ P® M (r M / r*) . 



Using now a well known Birge's or Huber's lemma, (see, e.g. lDevrove. Gvorfi and Lugosi . 
199a p. 243), we get 



sup P* m (t m ^t*)> 

JG{0,l} m 



0.36 A 1 



MK H 
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where K H := sup PQen K(P,Q), H := {P w , u € {0, l} m } and K(P,Q) is 
a Kullback-Leibler distance between two measures P and Q. Since for any 
two measures P and Q from H with Q P 



K(P,Q) < sup E 

wi^ 2 G{0,l} m 



+(l-C wl (Xi))lo to , , 



x P(0 < X\ < 5m-~<) 



A 2 m- 2 ^ a 



with some constant C > for small enough A, and log(|K|) = mlog(2), we 
get 



sup P® M (r M /r*)> 

u>G{0,l} ra 

with some constant C > 0. Hence, 



0.36 A (l - CMm^' 2 ^"- 1 



sup P® m (t m ^t*)>0 

wG{0,l} m 

provided that m = qM 1 ^' y+2 ' y ^ a+1 ^ for small enough real number q > 0. In 
this case 

sup (E P jG r *(X r *)] - EpgM {E P JG TM (X rM )]}| 
^e{o,i} m L J 

> 4p*£g _7/a_7 M- (7/a+7)/(7+27/a+1) = 5M"WW 

with 5 = Ap,^- 7 / a " 7 . 

6 Auxiliary results 

We have 

M 

Am(5) := VM^{ ff5 (xi m) ,...,^ m) )-Eb 5 (X 1 ,...,^)]} 



m=l 



with functions j^iR x...x 



defined as 



K 

K—l 



9s 



(X!,...,X K ) ■■- ^2 G k+l( x k+l)l{x 1 ?S 1 ,...,x k ?S k ,x k+1 eS k+1 }- 



k=0 



Denote Q = {gs : S £ &}. Obviously Q is a class of uniformly bounded 
functions provided that all functions Gk are uniformly bounded. 
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Definition Let Mb(S, G, P) be the smallest value of n for which there exist 
pairs of functions {[gj , gf]}™^ such that \\gV — gj\\i, 2 (p) < 5 for all j = 
1, . . . , n, and such that for each g £ Q, there is j = j(g) € {1, . . . ,n} such 
that 

flf < 9 < 9?- 

Then Hb(<5, G, P) = log [A/b(5, <7, P)] is called the entropy with bracketing of 



Th e following theorem follows directly from Theorem 5.11 in I Van de Geer 
(|2000h . 



Theorem 6.1. Assume that there exists a constant A > suc/i that 

(6.17) H B ((j 5 g,p) < at" 

for any 5 > and some > 0, where Hb(8,G,P) is the 8-entropy with 
bracketing of Q. Fix some So £ & then for any e > M~ 1 /( 2 + 1 ') 

P ( sup \A M (S) - A M (S )\ > Ue l -'i \ < CeM-Ue^/C 2 ) 

|Am(«S) - A M (5 )| 1 , rr//0 2\ 

sup 1 _ ty/2 — > (7 I < C exp(— U/C ). 

^e6,[|9s-fls [| i2 (P)<e ||5.S -55oIIl 2 (P) / 

/or all U > C and M > Mo, where C and M$ are two positive constants. 
Moreover, for any z > 

P (sup \A M (S) - A M (S )| > zVm ) < C7exp(-Mz 2 /C 2 S) 

with some positive constant B > 0. 

Let us define a pseudedistance Ax between any two sets S,S' E 03 in 
the following way 

Ax(5 1 x...x5j f) 5lx...x5^ := ^P (x k e (S fc AS£) \ [ f| <SM ] . 

k=l V V i=fc / / 

It obviously holds Ax(S, S') < dx(S, S') for the pseudodistance dx defined 
in The following Lemma will be frequently used in the sequel. 

Lemma 6.2. For any S,S' £ 23 it holds with probability one 

(6.18) G Tk{s) (X Tk[s) ) - G Tk{sl) [X Tk{s ^ 

K-l 

< l G K^0- G T, +1 (5)(^ n +i(5))l 1 {x i e(5 ! A5 i ')\(n^r ! 1 s;,)} 
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and 



(6.19) V k *(X k )-E[G Tk(s) (X Tk{s) )\F k } 



E 



K-l 



£|gkxO-e[vt + iPQ+i)I^]|i 



fork = l,...,K-l. 



Proof. We prove (|6. 19j) by induction. The inequality (I6.18|) can be proved 
in a similar way. For k = K — 1 we get 

y£_i(**-i) - v K -x{x K -x) = 

= E (Gx_i(I^-i) - G , ^(Xx))l{r; f _ 1 =x-i,r x _i=^} -^A'-i 

+ E (Gk(Xk) - GA--l(^A'-l))l{r*._ 1 =^,T K _ 1 =i^-l} -^A'-l 

= |Gif_i(Xif-i) - E[G K (X K )\J r K-i]\l{T K ^T* : _ 1 } 

since events {t^_ 1 = K} and {tk-i = K} are measurable w.r.t. Tk~\ and 
Gjt_i(^_i) > E[G K (X K )|^„i] on the set {r* K _ x =K-1}. Thus, (16~T3D 
holds with k = K — 1. Suppose that (16.190 holds with fc = iT' + 1. Let us 
prove it for k = K' . Consider a decomposition 



Gt k> ( Xt k' ) ~ Gt k> ( Xt ki < 



Si + S 2 + S 3 



(6.20) 
with 



#2 := (g t . # (X r ^ ) - G TK , (X Tk , )) l {r * />K , tTKl=K , } , 

S 3 : = (g^, (X t ^ # ) - G TK , (X Tk , )) l{r* Kl =K',r Kl >K'}- 

Using the fact that = Tk+i if T k > k for any k = 1, . . . , K — 1, we get 



E[£i|.Ffc:'] = E (Vk-z+iC^Ck-'+i) - Vk-' + i(XK-/+i)) l{ r * >k',t k ,>k 



'} 



and 

E[S 3 \F K ,] 



E 



( ^ T K'+l^' r if'+l- 



F, 



- G K >(X K r)j 1{ t * ki>K , TkI=K ,} 

(E [V^, +1 (XiC/ + i)| .Ftf/] - G/f/(X(tx'))) !{r* >i<r',r Rr ,=K'} 



Gk'(Xk') ~ E Gr K / +1 (^T K / +1 )|.7 : k' 



1 {^,=^',^'>^'} 



(Gk'^kO -E[V^, +1 (X K t + {)\T K ']) 1{t£, =!<:', r^ir 



+ E 



- Vk'+i(X K '+i)) 1{t*,=K',t k ,>K' 
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with probability one. Hence 

V K'( X K') ~ V K i(X K i) = \G K r(X K t) - E[V£, +1 (X K , +1 )\F K >]\ l{r K ,^,} 

+ E [[V£, +1 (X K >+i) - V K r + i(X K >+i)) I T K >\ 1 {t k ,>k 1 } 

since G K >(X K i)-E[V^ l+l (X K i + i) > on the set {t* k , = K'}. Our induction 
assumption implies now that 

V£,{X K .)-V K ,{X K ,) = 

K-l 

mx^-mxiixi+i^m {T-fe^r|,T fc >fe,...,r K _i>i<r-l}l^ r f!r' 

lk=K> 

and hence ([6TT9]) holds with k = K' . □ 

Corollary 6.3. If va.axk=i,...,K \\Gk\\oo < with some constant Aq > 0, 
then 



E 



A G (S,S') < 2A G ^2A X {S,S') 

for any S, S' G 25. 

Proof. Follows directly from (|6.18p since G T (X T ) < Aq a.s. for any stopping 
time r taking values in {1, ... , K}. □ 

Corollary 6.4. Assume that (|3.1U|) holds for 5 < 5q < 1/2, then there exist 
constants v a and 5 a such that 

(6.21) A{S) > v a A x +a)/a (S*,S) 

for all S G 23 satisfying Ax(S*,S) < 6 a . Moreover it holds 

f 2 l l a \ 5 

(6.22) A X (S*,S)< — A(S) + " 



<5 y v ' 2(1 + a)' 
/or any «S G 25. 

Proof. For any 5 < <5o define the sets 

A k :={xeR d :\E[V k * +1 (X k+1 )\X k = x]-G k (x)\>s], k = 1, . . . , K-l. 
Due to (|6.19p we have 

A(S) > d^p(x k G(SlAS k )\( f~]Sj)j[]A k \ 

k=l V V l=k J J 

^ | p g (n Sk )) ~ F{Ak) ) 

(6.23) > 5[A X (S*,S)-A 5 a ] 
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with Ao = Y^,k=l ^fc.Oi where A^q were defined in (|3.10f) . The maximum 
of ([OSD is attained at 5* = [A x {S*,S)/(a + l)^ ] 1/a - Since 5* < 5 for 
A X (S*,S) < Ao(a+l)5g the inequality (|6T2T1) holds with v a := A~ 1/a a(l + 
a) -1-1 /" and 5 a := Ao(cx+1)5q. The inequality (|6.22[) follows directly from 
IjCTl) by taking S = 5 /2 l l a . □ 
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